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Introduction 


All coronaviruses (CoVs), including the causative agent of severe acute 
respiratory syndrome (SARS), encode a nucleocapsid (N) protein that 
harbors two independent RNA binding domains of known structure, but 
poorly characterized RNA binding properties. We show here that the N- 
terminal domain (NTD) of N protein from mouse hepatitis virus (MHV), a 
virus most closely related to SARS-CoV, employs aromatic amino acid- 
nucleobase stacking interactions with a triple adenosine motif to mediate 
high-affinity binding to single-stranded RNAs containing the transcrip- 
tional regulatory sequence (TRS) or its complement (cTRS). Stoichiometric 
NTD fully unwinds a TRS-cITRS duplex that mimics a transiently formed 
transcription intermediate in viral subgenomic RNA synthesis. Mutation of 
the solvent-exposed Y127, positioned on the B-platform surface of our 
1.75 A structure, binds the TRS far less tightly and is severely crippled in its 
RNA unwinding activity. In contrast, the C-terminal domain (CTD) exhibits 
no RNA unwinding activity. Viruses harboring Y127A N mutation are 
strongly selected against and Y127A N does not support an accessory 
function in MHV replication. We propose that the helix melting activity of 
the coronavirus N protein NTD plays a critical accessory role in subgenomic 
RNA synthesis and other processes requiring RNA remodeling. 

© 2009 Elsevier Ltd. All rights reserved. 
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break originating in Guangdong province in 2002.’ 
Two additional human CoVs, NL63 and HKU1, 
have since been isolated and are associated with 


Coronaviruses (CoVs) harbor very large positive 
sense RNA genomes (30 kb) and cause a range of 
upper and lower respiratory tract infections in both 
veterinary animals and humans. CoVs include the 
causative agent of severe acute respiratory syn- 
drome (SARS), SARS-CoV, which was associated 
with substantial mortality during the initial out- 
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lower respiratory tract disease but limited mortality 
in healthy individuals.*° Mouse hepatitis virus 
(MHV), on the other hand, is the prototype group 
2 coronavirus that serves as a well-studied model 
system for SARS-CoV.* 

SARS-CoV and related viruses are highly recom- 
binogenic due in part to an unusual transcription / 
replication cycle involving the synthesis of a 
8 subgenomic RNA (sgRNA) intermediates.” are 
minus-sense discontinuous transcription model,” 
well accepted model for sgRNA synthesis, poate 
lates that genome circularization, for which there is 
now genetic evidence,” is a requisite initial step. This 
enables individual transcriptional regulatory 
sequences (TRSs) in the genome body (TRS-B) to 
come in close physical proximity to the single TRS in 


0022-2836/$ - see front matter © 2009 Elsevier Ltd. All rights reserved. 


TRS Binding by CoV N Protein NTD 


the leader (TRS-L) of the 5’ untranslated region 
(UTR) during nascent strand synthesis. At each TRS- 
B sequence, the replication complex either continues 
transcription of genomic RNA template, or engages 
in a “template switch” that generates minus-sense 
sgRNAs. These can subsequently function as tem- 
plates for production of positive-sense sgRNA 
transcripts that are ultimately translated by host 
ribosomes into the structural proteins of the virus. 

Coronaviral N is a multifunctional phosphopro- 
tein’’” that plays a primary structural role in 
packaging the RNA genome into a helical ribonuc- 
leoprotein,’* as well as regulatory roles in viral RNA 
synthesis (replication and transcription), translation, 
and modulation of host cell metabolism.'*'? In situ 
crosslinking and immunoprecipitation experiments 
reveal that N interacts with multiple regions of 
positive-and negative-sense coronaviral genome and 
all sgRNAs!*'* including the 5’ leader. N-specific 
antibodies inhibit mouse hepatitis virus (MHV) RNA 
synthesis in vitro'° and N has been shown to signi- 
ficantly enhance the efficiency of RNA replication.!” 
These and other data implicate N as an important 
accessory factor in discontinuous transcription.” 
These multifunctional properties of CoV N are 
analogous to HIV-1 nucleocapsid protein'’ and 
make N an attractive antiviral target. 

A structural and mechanistic understanding for 
how CoV N protein performs its myriad functional 
roles is limited. N proteins contain two RNA- 
binding domains of known structure: an N-terminal 
RNA binding domain (NTD) and a C-terminal 
dimerization domain (CTD)*® linked by a Ser/ Arg 
(SR)-rich linker (Fig. S1). Biochemical data suggest 
that the CTD is involved in oligomerization of N 
dimers,*”** and a small-angle x-ray scattering study 
suggests that the NTD and CTD do not interact in 
the absence of RNA.* The structures of the CTD 
from avian infectious bronchitis virus (IBV) and 
SARS-CoV reveal a tightly intertwined domain- 
swapped dimer*”** with the CTD N-terminal 
region, rich in basic amino acids, implicated in 
nucleic acid binding.*’*° The structures of the NTD 
from SARS-CoV?6?” and IBV***® N have also been 
reported. The SR-rich region has been implicated in 
RNA binding in MHV? and in regulation of the 
oligomerization of SARS-CoV N* and a recent 
report provides genetic evidence for N-N interac- 
tions mediated by the NTD.”’ The reported affinity 
of N for Uz is in the 1-10 uM range (Kg) with no 
evidence for or against RNA binding specificity. 

Operating from the premise that the NTD and 
CTD fold independently into separable RNA bind- 
ing domains,*’ we show here that the isolated NTD 
makes a specific, high affinity complex with the TRS 
and efficiently melts a TRS-cTRS duplex. These are 
two necessary features of a role in stimulating 
template switching during discontinuous sgRNA 
transcription. A mutation that cripples duplex TRS 
unwinding is defective in stimulation of CoV 
replication in cell culture; these studies suggest 
that specific targeting of the N NTD may lead to new 
antiviral agents. 
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Results 


The MHV N NTD specifically binds to the 
TRS RNA with high affinity 


Since it is known that N plays an important role in 
seRNA synthesis”’ and can be crosslinked to the 5’ 
leader RNA in infected cells,"°°* we hypothesized 
that N makes a high affinity interaction with the 
TRS, a highly conserved hexanucleotide sequence 
(Fig. 1). To test this, we measured the binding 
affinity of a 5’ fluorescein (F)-labeled decanucleotide 
corresponding to the MHV TRS (F-5’-gAAU- 
CUAAAC) with N219, an N domain protein 
fragment containing the folded NTD and the 
immediately adjacent intact SR-rich region (residues 
60-219; Fig. S1), by fluorescence anisotropy. These 
data reveal that the N219-TRS complex is character- 
ized by a Kops=9.0 10’ M™ at 150 mM K™, 25 °C 
(Fig. 1, Table 1). To address the nucleotide specificity 
of NTD, we carried out fluorescence anisotropy- 
based RNA competition experiments with unla- 
beled mutant TRS RNAs (Fig. 1c; Table 1). Essen- 
tially all mutations in the TRS result in a decrease in 
Kobs, With a random RNA of the same length 
binding 53-fold_ less tightly. Substitution of 
SUCU™” with °GAG” (TRS-Y3r) results in a 
modest ~2-fold decrease in K,,, whereas complete 
replacement of the °°AAA” sequence with °°CUU”” 
(TRS-R3y) results in a 20-fold decrease. Combining 
these two blocks of mutations into the same RNA 
(TRS-YR) suggests that these two effects are not 
additive (AG.=-1.1 kcal mol), thus revealing that 
the “AAA” to °°CUU” substitution is globally 
destabilizing to the interface. Finally, the comple- 
mentary TRS sequence, 5’-AGUUUAGAUU (cIRS), 
adheres exactly to the 5’/-RRYYYRRRYY motif 
present in the TRS; consistent with this, the affinity 
of N219 for a cIRS labeled with the rhodamine 
derivative, DY547, gives Kop,=9.110’ M"™! 
(Table 1). These data taken collectively reveal that 
the MHV NTD forms a specific, high affinity com- 
plex with both the TRS and cI RS RNA sequences 
that would be present in the leader and body TRSs 
and the nascent minus-strand RNA transcript, 
respectively, during sgRNA transcription. 

Analysis of RNAs harboring successive 1-to 5- 
nucleotide deletions from the 5’ end of the TRS RNA 
decanucleotide suggest that these nucleotides up- 
stream of the °°AAA”’ motif provide electrostatic 
stabilization to the complex, with AAG,,, per loss of 
successive phosphate groups as anticipated from a 
simple polyelectrolyte binding model’? (Fig. 2 and 
Table $2). 


The SR-rich region does not engage in specific 
interactions with the TRS RNA 


Previous studies suggested that the SR-rich region 
provides most of the binding determinants for the 
specific interaction with the leader RNA in MHV.”” 
To investigate the contribution of the SR-rich region 
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Fig. 1. MHV N NID RNA binding assays. (a) The MHV 5’ leader sequence consisting of the first 72 nucleotides.*” The 
3’ most ten nucleotides containing the conserved hexanucleotide core TRS sequence (red) was used for binding assays. (b) 
Fluorescence anisotropy titrations of MHV N219 (green), N197 (magenta) and the CTD dimer (black) to 5'-fluorescein (F)- 
labeled TRS in 50 mM KP;, pH 6.0, 100 mM KCI, 25 °C. The solid lines represent best fits to a one site (N219 and N197) or 
two site (CTD) binding model (fitting models (1) and (3), respectively, Materials and Methods). (c) Dissociation of an 
N219-5’-F-TRS complex by unlabeled mutant TRS RNAs as monitored by anisotropy. RNA competitors are unlabeled 
TRS (black), TRS-A69u (red), TRS-Y3r (cyan); TRS-A68u (magenta), TRS-A70u (green); TRS-R3y, (blue); random 9mer 
(yellow). The continuous lines through the data are defined by the binding parameters obtained by analysis using fitting 
model (2) (Materials and Methods) and are compiled in Table 1. 


in TRS binding, we determined the affinity of N197,_ (Kops=1.9 + 0.110% M"™; Table 1). N197 makes a 
an NTD construct lacking the SR-rich region high affinity complex with TRS, but one character- 
(residues 60-197; Fig. S1) (Fig. 1b), for TRS RNA ized by an approximately 5-fold decrease when 


Table 1. Binding affinities of MHV N protein domains for TRS RNAs* 


N Protein RNA Sequence (5’ to 3’) n AH (kcal mol’) Kops (x10 M”) Fold decrease 
N219 TRS gAAUCUAAAC Ll 20.2 23.4 + 0.3 9.0 + 0.9 — 
N219 TRS-A70u gAAUCUAAuC 10 2 Oe -15.3 + 0.3 2.6 + 0.4 3D 
N219 TRS-R3y gAAUCUcuuC nd* nd 0.44 + 0.01 20 
N219 TRS-Y3r gAAgagAAAC nd nd 41+ 0.2 a2 
N219 TRS-YR gAAgagcuuC nd nd 1.4 + 0.7 6.4** 
N219 TRS?" AAUCUAAACU nd nd 92208 1.0 
N219 cIRS AGUUUAGAUU 11 202 —23.7 + 0.1 91+ 0.8 1.0 
N219 cTRS-R3y°" AGUUUcuuUU 09 cd 04 -19.9 + 0.3 0.54 + 0.03 17 
N219 Random10 auauagcuac nd nd 0.17 20.09 53 
N219 SARS-TRS CUUCUCUAAACGAAG L1e02 19.2205 29 20.3 rl 
N197 TRS gAAUCUAAAC nd nd 1.86 + 0.09 4.8 
Y1I27A:.N2Z19 TRS gAAUCUAAAC 1a O2 —21.0 + 0.1 0.48 + 0.06 19 
CTD dimer TRS gAAUCUAAAC ne*** ne*?? 0.03 ='0,01 300 


‘Conditions: pH 6.0, 0.15 M K*, 25.0 °C. Kops values are averaged values from direct or competition fluorescence anisotropy-based 
titrations and isothermal titration calorimetry (ITC). n and AH are reported from ITC measurements. Other fitted parameters are 
summarized in Tables S1-S4. *nd, not determined. **The pairwise cooperative (coupling) free energy, AG, was calculated from AG,= 
AAGrrs-yr - (AAG ors-R3y+ A AGprrRs-y3r), where AAG= NG aden = AGprrs- This gives AG,= —1.1 kcal molt. PAG, no heat observed. 
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Fig. 2. (a) Representative ITC titrations of N219 into wild-type 10-mer TRS (1 uM) and a 5’ truncated 6-mer (5/- 
UAAACU; 25 pM) in 50 mM K"phosphate, 100 mM KCl, pH 6.0 at 25 °C. The red line indicates the best fit according to a 
one-site binding model (see Table S2 for fitted parameters). (b) A linear dependence of all three state functions, AGops, 
AHops, and ASop, is observed on the length of the 5’-hydroxylated TRS, with AAG,ps, AAH ops, and A(-TAS ps5) of —0.8, 
~3.8 and +3.0 kcal mol per nucleotide (or phosphodiester) deletion, respectively. The fact that nucleotide substitutions of 
the same 5’ bases are far less destabilizing (see Table 1), coupled with the same energetics penalty for loss of successive 
nucleotides (or -1 charge) from the chain strongly suggests that this region of the TRS stabilizes the N219 complex 
electrostatically, with the core °° AAA” motif providing significant nonelectrostatic stabilization to the complex. 


compared to N219; this suggests that the nucleo- 
base-specific interactions are contained entirely 
within the NTD. The increase in binding affinity is 
likely due to the presence of five additional positive 
charges from the SR-rich region, contributing a 
larger electrostatic component to the binding energy 


in N219 vs. N197. To test this, standard “salt-back” 
dissociation experiments were carried out to obtain 
information on the extent to which electrostatic 
interactions stabilize the NTD-TRS complex.” 
SKops, the dependence of Kp, on [K"], for the 
N219-TRS interaction is large (SK,p,= —5.5), consis- 
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Table 2. X-ray data collection and refinement statistics for 
MHV N197 (60-197) 


MHV N residues 64-194 
Data Collection 


Space group P2;2)2; 
Cell dimensions 

a, b, c (A) 34.16, 46.86, 71.71 
Resolution 22.3-1.75 (1.78-1.75)* 
I/ol B71: (29) 
Completeness (%) 005 :(97 2) 
Redundancy o2.(3.7) 
Refinement 
Resolution (A) 22.3-1.75 (1.78-1.75) 
Reym 0.049 (0.405) 


No. reflections 


Used for refinement 11,055 

Used for Ree. calculation 1,078 
R factor (%) 18.5 
Reree (Y%) 22.6 
No. atoms 

Protein 1046 

Water molecules 127 
B-factors (A’, average) 

Protein 28.94 

Water molecules Bo.07 
r.m.s. deviations ; 

Bond lengths (A) 0.010 

Bond angles (°) 1.064 


* Values in parentheses are for the highest resolution shell. 


tent with 7-8 ionic interactions in the complex for 
the RNA binding (Fig. 52), with 55% of the total 
binding free energy at 0.15 M K*contributed by the 
polyelectrolyte effect. In contrast, the SK op, of N197 
is smaller, —3.9, with the polyelectrolyte contribution 
only ~40% under these conditions. Thus, in this 
simplified polyelectrolyte model, N219 engages in 2- 
3 additional electrostatic interactions with the RNA, 
likely contributed by a subset of the C-terminal Arg 
residues in N219 vs. N197. 


MHV NTD adopts a U-shaped £-platform 
structure 


To begin to understand the molecular determi- 
nants of the interaction between the TRS and NTD, 
we solved the crystallographic structure of MHV 
N197 (residues 60-197), using the structure of the 
SARS NTD* as a search model for molecular 
replacement (Table 2). The structural model (Fig. 3) 
encompasses residues 64-194, with only the side 
chain of K113 in the B2’-B3’ hairpin loop modeled as 
an Ala due to poor side chain density. The 130- 
residue MHV NTD adopts a U-shaped B-platform 
which contains five short B-strands (arranged B4-B2- 
B3-B1-B5) across the platform and, as expected, 
adopts a fold that is nearly identical to NTDs of other 
coronaviral N proteins.*”*°*°° The putative RNA 
binding groove is characterized by the palm of the B- 
platform and an extended B-hairpin that collectively 
contain a large number of basic and aromatic amino 
acids that are proposed to directly interact with RNA 
(Fig. 3(a)-(b)). The base of the hairpin loop is strongly 
positively charged (Fig. 3(c)), with the temperature 
factors increasing as one moves away from the 
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platform region to the tip of the 82’-B3’ hairpin 
(Fig. 3d). On the other hand, the C-terminal SR-rich 
region may effectively extend the RNA binding 
groove of N197 in N219. 


Mutations in N219 influence the TRS 
binding affinity 


We next determined the binding affinities of 
R110A, Y127A and Y129A N219s for the TRS RNA 
using our anisotropy-based assay. Although R110A 
and Y129A N219s each show only a modest 
decrease in binding affinity, characterization of the 
Y129A/R110A double mutant suggests that these 
the two residues are modestly energetically coupled 
(AG.=-0.4 kcal mol), consistent with a long- 
distance cooperativity across the B-platform (Table 
53, Fig. 53). The difference in binding of this double 
mutant to the TRS-R3y RNA relative to N219 is 
identical to that observed for the wild-type TRS 
RNA; these data suggest that Y129 and R110 are 
unlikely to make base-specific contacts with-AAA- 
sequence (Table 53). In contrast, Y127A N219 binds 
the TRS with ~ 19-fold decrease in affinity relative to 
wild-type N219. 'H-'°N HSQC spectra of Y127A 
N219 suggest only localized structural perturbations 
in the mutant (Fig. 54). 


TRS binding to N219 is strongly enthalpically 
driven 


Given the anticipated involvement of aromatic 
residue-nucleobase stacking as an important part of 
the NTD-TRS interface, we next sought to under- 
stand the underlying thermodynamic origins of the 
binding affinity by ITC (Fig. 4 and Table 54). For 
both wild-type and Y127A N219s, complex forma- 
tion is characterized by a significant enthalpic 
driving force (Table 1). Of particular note is that 
the difference in binding free energy between these 
two N219 proteins is entirely enthalpic in nature, .e. 
AAG=AAH, with A(-TAS)=0. This is consistent 
with a direct m-stacking interaction between Y127 
and one or more TRS nucleotides (Fig. 4) although 
other structural scenarios are possible. In contrast, 
the energetics of the binding of N219 to the TRS- 
A70u RNA, which harbors a single base substitution 
of the -°AAA”’-motif, reveal a significant decrease 
in the entropic penalty coupled with a vastly diffe- 
rent AH relative to the wild-type TRS RNA. This 
suggests a different mode of binding for this mutant 
RNA to N219 (Table 1). 


MHV N NTD binds tightly to the SARS-CoV TRS 


The high conservation of residues in the palm of 
the CoV NTDs**°*> and the core TRS (Fig. 1) 
makes the prediction that the MHV NTD should 
form a non-cognate complex with the SARS-CoV 
TRS. Two putative TRS sequences have been 
proposed for SARS-CoV,*””® with the first contain- 
ing the 5’-CUAAAC core observed in other CoVs 
and the second, 5’-ACGAAC, just downstream and 
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Fig. 3. Crystallographic struc- 
ture of MHV N197. (a) Ribbon 
diagram of MHV NTD shown 
with candidate RNA binding resi- 
dues (yellow). (b) The final refined 
2mF,-dF, electron density map of 
residues 124-131 is contoured at 1.5 
o to demonstrate the data quality of 
key f-platform residues in the 
model. (c) The electrostatic surface 
potential of MHV NTD shows the 
putative RNA binding region 
marked by a large electrostatic 
positive surface. (d) The ribbon 
diagram of MHV NTD shaded 
according to temperature factors of 
the Ca atoms. The color is ramped 
from dark blue (12.9-23.4 A’) to cyan 
(24.3-40.4 A’) to green (42.9-53.6 A’) 
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Fig. 4. Thermodynamic summary of MHV N NTD-TRS binding equilibria. Representative ITC titrations of (a) 10 uM 
N219 into 1 uM TRS and (b) 220 pM Y127A N219 into 10 uM TRS in 50 mM phosphate pH 6.0, 100 mM KCl and 25 °C. The 
red lines indicate the best fit to a one site binding model with the following parameters: N219, n= 0.92 (40.01), Kops= 1.4 
(+0.3)x 10’ Mt, AH= -23.3 (+0.4) kcal mol; Y127A N219: n= 1.02 (+0.01), Kops= 5-8 (40.2) x 10° M*, AH= -20.9 (+0.1) 
kcal mol"). (c) Graphical comparison of the enthalpic (AH) and entropic (-TAS) components of the free energy (AG) for 
N219 binding to TRS (open bars), Y127A N219 binding to TRS (black bars) and N219 binding to TRS-A70u RNA (see Fig. 1a) 


(hatched bars). 


overlapping the first (see Fig. 5a). Using the second 
putative TRS sequence, Baric and coworkers 
reported a ‘rewiring’ of the SARS-CoV genome by 
making parallel mutations in the TRS-L and TRS-B°? 
(Fig. 5a); however, these mutations are not expected 
to appreciably affect the binding affinity of N for the 
SARS-CoV TRS. We tested this using a 15-nucleotide 
5’-Cy3/3’-Cy5 labeled SARS-CoV TRS (Fig. 5b). By 
monitoring the anisotropy upon direct excitation of 
Cy5, we find that N219 binds to this RNA with a 
binding affinity of K,,;= 2.910’ M™ (Table 1). The 


~3-fold decrease in affinity is explained by the fact 
that the SARS-CoV TRS may exist as a weak stem- 
loop,*” giving rise to a competing equilibrium asso- 
ciated with melting the stem (Fig. 5a-b). The 
existence of the stem-loop in the doubly-labeled 
RNA was confirmed by a FRET efficiency (E) of 0.5 
(Fig. 5c), a value consistent with a hairpin-unfolded 
RNA equilibrium (Fig. 5b). Regardless, stoichiomet- 
ric N219 fully denatures this stem since E goes to 
zero. Companion ITC experiments further reveal 
that the AH,,., is 8 kcal mol™ less negative 
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Fig. 5. MHV N219 binds to the noncognate SARS-CoV TRS RNA. (a) Schematic representation of SARS-CoV leader 
(left), with core (red letters) and alternative TRS (blue letters; magenta letters, overlapping region of the two TRSs highlighted. 
Broken stem and rewired SARS-CoV TRS sequences are also shown *” (right). The green and red stars represent 5’ Cy3 and 
3’ Cy5 FRET donor and acceptor dyes, respectively. (b) Cartoon representation of how N219 drives the hairpin-ssRNA 
equilibrium toward unfolded ssRNA. (c) Plot of FRET efficiency of the SARS-CoV TRS Cy3/Cy5 pair as a function of 
[N129]/[SARS-CoV TRS] ratio. The solid line represents a fit to a 1:1 binding model defined by the parameters in Table 1 
as resolved by nonlinear least squares fit to model (4) (Materials and Methods). (d) Representative ITC titration of 37 uM 
N219 into 1 uM SARS TRS stem-loop labeled with a 5’-Cy3 and 3’-Cy5 FRET pair. Top panel, raw ITC data; bottom panel, 
concentration normalized enthalpy plotted vs. molar ratio. The red line depicts the best fit according to a one-site binding 
model where n= 1.2 (40.1), Kops= 1.9 (+0.2)* 10” M* and AH= -15.2 (+0.2) kcal/mol. 


compared to the MHV TRS (Table 1, Fig. 5d, Table 
54); this is as expected if endothermic stem melting 
is coupled to N219 binding. To verify this, a broken- 
stem mutant (bsSARS), which corresponds to two of 
the three mutations used in the rewiring study, was 
investigated along with the fully rewired TRS 
(rwSARS) RNA (Fig. 5a). The resulting increase in 
Kops and —AH,p, observed for each of these RNAs is 
consistent with N219-inducing melting of the helical 
stem in the wild-type SARS-TRS RNA (Table 54). 


N219, but not Y127A N219 or N197, efficiently 
melts a duplex TRS 


Since the N NTD makes a high affinity complex 
with both the TRS and cTRS, we hypothesized that it 
might melt an RNA duplex between the template 
TRS and nascent cIRS strand. We tested this using a 


FRET-based assay with a preformed 5’-Cy3-TRS-3’- 
Cy5-cTRS duplex RNA which is characterized by a 
FRET efficiency of ~0.90 under these conditions 
(Fig. 6a,b). Addition of N219 results in an increase in 
the Cy3 emission intensity with a concomitant 
decrease in the Cy5 emission intensity to a FRET 
efficiency of zero, indicative of complete duplex 
melting. The subsequent addition of KCI to these 
mixtures results in dissociation of the N219-ssRNA 
complexes (see Fig. 52), and full recovery of the 
FRET efficiency associated with the duplex; this 
shows that N219-mediated RNA unwinding is fully 
reversible (Fig. 55). A quantitative analysis of these 
data to an equilibrium model that explicitly invokes 
the possibility that N219 binds to the duplex (Ky, in 
Model 5, Materials and Methods) reveals an affinity 
of<1 M1, ie., this complex does not form. In contrast, 
while the CTD dimer clearly binds to this duplex, it 
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is unable to denature it, even under conditions 
where Kp, are comparable for the two domains 
(35 mM K*vs. 150 mM kK’) (Fig. 6b; Table S1). 

Strikingly, while Y127A N219 is capable of 
melting the TRS-cTRS duplex, it is strongly kine- 
tically impaired (Fig. 6c,d). In addition, there is a 
significant enhancement of the Cy5 emission inten- 
sity upon addition of Y127A N219 not observed 
with wild-type N219; this appears to be the result 
of a Y127A:dsRNA complex, since direct excitation 
of the single-stranded 3’-Cy5-cTRS RNA:Y127A 
N219 complex yields no such enhancement 
(Fig. S6). We interpret this as a ribonucleoprotein 
complex-mediated modulation of the environment 
of Cy5.*' At a saturating concentration of Y127A 
N219, we observe a rate constant of k= 8 +1x10%*s" 
or 230-fold slower than wild-type N219. Finally, for 
both Y127A N219 and WT N197, the FRET effi- 
ciency fails to return to zero even after very long 
incubation times with saturating N protein, as 
expected for the full duplex dissociation observed 
for wild-type N219 (Fig. 6b,d). This finding sug- 
gests an incomplete melting of the dsTRS RNA, 
implying a partially melted, long-lived intermediate 
complex with these two proteins (Fig. 6e). These 
observations reveal that the SR-rich tail and key 
residues on the £-platform, e.g., Y127, function 
cooperatively to melt the duplex TRS in a kineti- 
cally facile manner. 
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Fig. 6. N219 melts a TRS-cTRS 
duplex. (a) Representative fluores- 
cence emission spectra of 5/-Cy3- 
TRS/3’-Cy5-rTRS RNA duplex 
obtained on titration with N219. 
Arrows, changes in the spectra as 
N219 is added; dashed box, iso- 
sbestic point. (b) Plot of FRET E asa 
function of total protein concentra- 
tion. N219, black symbols; Y127A 
N219, green; N197, magenta and 
CTD dimer, blue. The red and 
magenta continuous lines define a 
best fit to a model that assumes that 
the NTD in each case forms a 1:1 
complex with the component single 
strands with the affinities fixed to 
their known values (Table 1), and 
the affinity for the duplex RNA 
optimized in the fit (see Materials 
and Methods, model (5)). (c) Emis- 
sion spectrum of 50 nM 5’-Cy3- 
TRS/3/-Cy5-rTRS dsRNA duplex 
(cyan line) upon addition of 12 uM 
Y127A N219 (magenta line) acquired 
at different time points. (d) Plot of 
FRET efficiency as a function of 
3) time from the data in panel (c). The 
S continuous line is a best fit to a 
single exponential dissociation 
model with k= 8 (+1)x107 s". (e) 
Cartoon model of NTD-mediated 
unfolding of the TRS-cTRS duplex 
by Y127A N219. 
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Recovery and functional analysis of Y127A and 
Y129A N-containing viruses 


To test the functional importance of these Tyr 
substitutions on viral replication, we electroporated 
BHK-R cells with Y127A and Y129A N-containing 
MHV genomes in the absence or presence of a 
“helper” RNA encoding a wild-type or mutant N 
gene. From electroporations of the Y127A N-contain- 
ing MHV RNA, we were only able to recover (18/18) 
wild-type N genes from plaque-purified virion 
particles, irrespective of whether we used wild- 
type, Y127A, or no helper RNA. In contrast, Y129A N 
viruses were recovered regardless of the presence of 
the helper RNA. 

We next tested the effect of these mutations in an 
infectious center assay. Here, cDNAs representing 
the wild-type MHV genome were in vitro trans- 
cribed and electroporated into BHK-R cells in the 
presence of either a WT, Y127A or Y129A N helper 
RNA, with the electroporated cells plated on con- 
fluent L2 cells, incubated at 37 °C for 5h to allow for 
cell attachment, and then overlaid with agarose- 
containing media. Plaques were counted 3 days later 
(Fig. 7). Although we find a modest decrease in 
infectious center formation with the Y129A N helper 
RNA relative to the wild type helper, the number of 
infectious centers formed with the Y127A N helper 
RNA is identical to that observed in the absence of 
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Fig. 7. The replication accessory function of N is 
severely impaired in the Y127A N NTD mutant relative 
to the Y129A and wild-type N. BHK-R cells electroporated 
with wild-type MHV genomic RNA in the presence of the 
indicated helper RNA were grown to confluence and 
infectious centers in L2 cells counted (mean and s.d. from 
three experiments). 


any N helper RNA. These experiments taken 
collectively provide genetic evidence that the 
Y127A N-protein containing virions are strongly 
selected against, and that the NTD, and specifically 
Y127, plays a critical role in stimulation of viral 
replication. 


Discussion 


Although the atomic resolution structures of the N 
protein NTD and CTD from several coronaviruses 
are now available*?***°”° (this work), detailed 
knowledge of the RNA binding properties of N is 
rather limited. We show here that MHV N219 forms 
a high-affinity complex with both the MHV and non- 
cognate SARS-CoV TRS” a finding that speaks to the 
conservation of the NTD-TRS interaction as a 
conserved feature of CoV replication, despite the 
distinct structural contexts of the leader TRS in each 
case. We also show that the N NTD possesses potent 
helix-destabilizing activity. The NTD employs 
enthalpically stabilizing base stacking interactions 
to drive high affinity and sequence-specific complex 
formation with the single-stranded TRS and cTRS 
RNAs. This binding, in turn, strongly enhances the 
rate of TRS-cTRS duplex melting that models an 
intermediate in sgRNA transcription by the corona- 
viral replicase complex. Full helix-destabilizing 
activity of the N NTD requires determinants on 
both the B-platform, i.e., Y127, as well as the SR-rich 
domain. Although we have not determined the RNA 
binding specificity ratio of N197 (K,,; see below), 
comparative studies of N197 and N219 suggest that 
the B-platform domain provides key specificity 
determinants for TRS recognition, with the SR-rich 
region stabilizing the complex via non-specific 
electrostatic interactions, likely with the region just 
5’ to the TRS core sequence. These findings suggest 
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that phosphorylation of $170 or T177 in MHV N,’° 
the latter of which is in close proximity to the C- 
terminal 85 strand, or Ser residues within SR-rich 
domain,’ might strongly modulate the ssRNA and 
helix-melting properties of the N protein. 

The degree to which N219 is capable of discrim- 
inating between a short TRS-containing oligonucle- 
otide vs. two "random" sequence RNAs of different 
base compositions, defined by the specificity ratio, 
Ksp=Krrs/Krandom 18 ©53 and ~25 for a 10-mer and 
9-mer RNAs, respectively. While K,, is modest 
when compared to bona fide sequence-specific RNA 
binding proteins,** it appears to be of the same order 
of magnitude determined for another viral nucleo- 
capsid protein, from HIV-1.*° Such a relatively small 
specificity ratio is not inconsistent with a N219-TRS 
binding mode that is characterized by a sizable 
electrostatic contribution to the binding energy, as 
well as the multiple functional roles N protein must 
play in the viral life cycle. It is striking, nonetheless, 
that a single Y127A substitution within the highly 
conserved WY'*’FY'*’Y sequence on the B3-strand, 
like that of a complete pyrimidine substitution of the 
triple adenosine motif °°AAA’° (in the 10-mer 
context), reduces the binding affinity to just three- 
fold above what we operationally define as non- 
sequence specific binding. In contrast, single aden- 
osine to pyrimidine substitutions within this 
°SAAA’® are not as destabilizing, but nearly 
additive (AAG=1.5 kcal mol") relative to the com- 
plete °° AAA” to °CUU” substitution (AAG=1.8 
kcal mol'), with the 3’-most A70 making the largest 
single contribution to the N219-TRS binding energy. 
These data are consistent with a model in which the 
3’ end of the TRS is anchored on the £-platform via 
enthalpically stabilizing aromatic base-stacking 
interactions with the 5’ side of the TRS held in 
place largely by electrostatic interactions that extend 
into the SR-rich tail. 

R110, Y127, and Y129 form a nearly contiguous 
surface on the B-platform, with the central residue, 
Y127, functioning as a linchpin in what appears to be 
a cooperative unit with the sum of the AAGs for any 
two single mutations (¢.g., Y127A and Y129A; 
Y127A and R110A) greater than that observed for 
the corresponding double mutant. Interestingly, an 
Ala substitution of the residue analogous to Y127 in 
IBV N, Y92A, leads to a profound reduction in viral 
replication.** The ability of N219 to facilitate the 
melting of a duplex TRS may underscore the ability 
of N to stimulate template switching during sgsRNA 
transcription, as well as function as a nucleic acid 
chaperone.'® Our functional characterization of 
Y127A N reveals that that this substitution abro- 
gates the ability of N to stimulate RNA replication, 
and the molecular origin of this defect is likely 
attributable to the kinetically crippled helix-un- 
winding activity of Y127A N. 

It is also known that coronavirus N encapsidates 
viral RNA into ribonucleoprotein (RNP) particles” 
and SARS-CoV N has been implicated in playing an 
essential role in viral RNA packaging;*© however, 
the mechanism of RNA packaging is far from clear. 


554 


A recent structural study of the SARS-CoV CTD led 
the authors to speculate that the Crp plays a a key 
role in the helical nucleocapsid" assembly.~”*° Our 
findings further suggest that the ability of the NTD 
to melt dsRNA may also play a role in RNA packa- 
ging or other steps of the viral life cycle where RNA 
remodeling is required. 

A model for how N protein-catalyzed unwinding 
of a transiently formed dsRNA between the body 
TRS and the cIRS in the nascent (daughter) strand 
might stimulate template switching during subge- 
nomic RNA transcription is shown in Fig. 8. In this 
model, template switching is an ordered unfolding 
of the TRS-cTRS duplex and subsequent hybridiza- 
tion of the nascent strand with the 5’ leader RNA. 
Biological studies in TGEV and SARS-CoV reveal 
that one or two nucleotides 5’ to the TRS core 
sequence, the core TRS itself, and <5 downstream 
nucleotides on the template strand are required to be 
identical to those in the leader TRS region for 
efficient stRNA synthesis to occur.””” This would 
optimally position key NTD recognition determi- 


RdRp 


(-) gRNA 
synthesis 5' 


NTD binding to 
TRS, cTRS 


NTD unfolding of 
TRS:cTRS duplex 


template switch 
completed 


(—) sgRNA 
synthesis 


Fig. 8. Proposed model for the role of N protein in 
subgenomic RNA transcription. The transcriptase-repli- 
case complex (labeled RdRp, green hexagon) initiates 
transcription from the 3’-polyadenylated end of the 
genomic RNA, and can either synthesize full-length (—) 
genomic RNA (upper panel), or switch templates to the 5’ 
leader at the TRS to synthesize (—) sgsRNA by copying the 
remainder of the 5’ leader (red line) (bottom panel).’ In the 
presence of an N protein dimer (purple ellipsoids 
connected by the black SR-rich linker), the two NTD 
domains bind to the TRS and cIRS sequences (orange 
lines), melt the duplex, and accelerate the rate of base 
pairing of the nascent cIRS with the leader TRS (middle 
two panels). The duplex RNA-binding activity of the CTD 
dimer may help anchor N to the nascent duplex. 
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nants, ¢.g., the triple adenosine motif, in the TRS and 
cI'RS RNAs in the middle of a TRS-cTRS duplex that 
likely forms behind the elongating RdRp.*”*” NTD- 
mediated local unfolding here would lead to an 
increase in the lifetime of the nascent strand in an 
unpaired state, thus accelerating the rate of nucle- 
ation of base pairing with core leader TRS, allowing 
the RdRp to switch RNA templates. 

It is not yet known if N interacts directly with any 
component of the polymerase complex, although 
antibodies against N strongly inhibit RNA transcrip. 
tion and N strongly stimulates virus replication.'°”° 
If accumulating structural and biochemical evidence 
for a closed-to-open conformational switch enabling 
processive elongation by viral RdRps*’*’ charac- 
terizes coronaviral RdRps as well, a direct interaction 
with N might inhibit elongation and perhaps pause 
the polymerase complex just past the 5’ end of the 
TRS, providing time for a template switch to occur. 
Efficient reconstitution of an active coronaviral 
RdRp complex on defined RNA templates”' will be 
required to test this model. 


Materials and Methods 


Preparation of RNA samples 


Unlabeled TRS and TRS mutant RNAs were obtained 
by in vitro runoff transcription using SP6 RNA polymerase 
and purified by. denaturing PAGE essentially as previ- 
ously described.** This protocol necessitates the addition 
of non-native 5’-terminal guanosine residue to some 
RNAs, denoted by the lower-case "g". All other unlabeled 
or fluorescently labeled RNAs were obtained from 
Dharmacon or IDT and purified by denaturing PAGE. 


Plasmid construction and protein expression 
and purification 


For the plasmids encoding various fragments of MHV- 
A59 nucleocapsid protein, the coding sequences were 
amplified from the full-length MHV N gene using 
standard PCR based approaches. The PCR products 
were digested by NdeI and BamHI and ligated into 
pET3a, pET15b, or pGST-parallel expression plasmids.” 
The plasmids encoding the substitution mutants were 
prepared using QuickChange PCR-based mutagenesis of 
the wild type N219 overexpression plasmid as a template. 
The integrity of all the constructs was confirmed by DNA 
sequencing. Recombinant proteins were expressed from 
their respective pET3a-N197 (residues 60-197), pET15b- 
N219 (residues 60-219) and pGST-CTD (residues 256-385) 
plasmids, in E. coli BL21(DE3)/pLysS. The growth, 
expression and purification of N fragments expressed 
from pET3a, pET15a or pGST were carried out using the 
procedures described previously”; GST-CTD proteins 
had 2 mM DIT in the buffer throughout purification. 
The GST tag was cleaved from the CTD by TEV protease 
overnight at 4 °C in 50 mM Tris, 100 mM NaCl at pH 8.0. 
CTD was separated using cation exchange and further 
purified using a Superdex G75 chromatography; the 
retention time was consistent with that of a dimer 
(28 kDa). The protein purity by inspection of Coomas- 
sie-stained 18% Tris-glycine SDS-PAGE gels was 
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estimated to be>95%. All proteins were further character- 
ized by MALDI-TOF mass spectrometry. The concentra- 
tion of purified proteins was determined using the 
calculated molar extinction coefficient at 280 nm, with 
proteins stored at —80 °C in concentrated aliquots. 


Crystallization and structure determination 


N197 overexpressed in E. coli and purified as described 
for N219 was concentrated to approximately 200 uM and 
buffered with 50 mM potassium phosphate (pH 6.0), 
100 mM KCl. Crystals were grown via hanging-drop 
vapor diffusion against 30% PEG 1000, 50 uM CAPSO (pH 
9.0) at 20 °C. Crystals grew overnight and were frozen in 
the well solution, 30% PEG 1000, 50 hM CAPSO (pH 9.0). 
Diffraction data were collected at-160 °C on an R-AXIS IV+ 
detector at Indiana University. The space group of the 
crystal was primitive orthrombic (P2,;2,;2,) with one 
protein monomer in the asymmetric unit. Diffraction 
data to 1.75 A was reduced using HKL-2000. Initial phases 
were determined using a portion of the crystal structure of 
SARS-CoV NTD (PDB: 2o0fz) as a molecular replacement 
search model in phaser.*° Iterative rounds of model 
building and refinement were carried out in Coot and 
Phenix, respectively. The N protein was then divided into 
ten segments by the TLSMD server?’ for TLS refinement. 
The quality of the final structure was verified using 
MOLPROBITY. A Ramachandran plot analysis revealed 
that 96.2% of residues are in the most favored regions and 
the remaining 3.8% of residues are found in additional 
allowed regions; no residues were found in disallowed 
regions. All structure-related figures were prepared using 
PyMOL (DeLano Scientific). 


Fluorescence anisotropy and fluorescence resonance 
energy transfer (FRET) experiments 


These experiments were typically performed on an ISS 
PC1 spectrofluorometer using 5.0 or 10.0 nM RNA 
(anisotropy) or 50 nM (FRET) RNA in 50 mM potassium 
phosphate, 100 mM KCI, pH 6.0, unless otherwise noted. 
TRS binding by N variants was measured by monitoring 
the change in the anisotropy of the labeled TRS. The 
binding of N variants to the unlabeled TRS and TRS 
mutant RNAs was followed using a standard competition 
assay. FRET experiments were carried out with RNA 
labeled with a Cy3-Cy5 pair (A,,=520 nm; Nen=550- 
700 nm; Cy3 Amax=9/70 nm; Cy5 Amax=670 nm).with the 
FRET efficiency, E, calculated from E= 1-([pa/Ip), where 
Ip is the Cy3 quantum yield and Ip, is the Cy3 quantum 
yield in the presence Cy5, following a 2-10 min equilibra- 
tion upon addition of the titrant. No change in the 
fluorescence intensity (quantum yield) of the component 
Cy3-and Cy5-labeled TRS and cTRS single-stranded 
RNAs, respectively, was observed; thus changes in I; are 
directly attributed to FRET or protein-induced fluores- 
cence enhancement (PIFE) (see Fig. S6).** Nonlinear least- 
squares fits to all binding isotherms were carried out using 
DynaFit”’ with the appropriate binding model (models 
(1)-(5), as indicated below). 


Nonlinear Least Squares Fitting Models 


Model (1): Equilibrium titration of N219 into fluores- 
cently labeled RNAs 


N219+ RNA* <> N219-RNA* Kops 
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Model (2): Competition equilibrium titration of unla- 
beled mutant RNAs into N219-labeled RNA complex 


N219+ RNA* S N219-RNA* 
(by direct titration with Model (1)) 
N219-RNA*+ RNA > RNA*+ N219-RNA eis 


Model (3): Sequential 2-site equilibrium titration of the 
CTD dimer (CTD) into labeled TRS 


K, (fixed) 


CTD+ RNA => CTD-RNA K, 
CTD+ CTD-RNA $ (CTD):-RNA K, 

Model (4): Equilibrium titration of N219 into a double- 
labeled SARS-TRS RNA hairpin using FRET 


dSARS-TRS > SARS-TRS (stem SL3 unfolding) K;,(fixed by E=0.5) 
N219+ SARS-TRS = N219-SARS-TRS Kote 

Model (5): Equilibrium titration of NTDs into a duplex 
TRS-cTRS FRET pair 


TRS+ cTRS = dsTRS (TRS annealing) K, (fixed) 
N219+TRS > N219-TRS K, (fixed) 
N219+ cTRS > N219-cTRS K; (fixed) 
N219+ dsTRS = N219-dsTRS Ky 


Isothermal Titration Calorimetry 


Isothermal titration calorimetry experiments were car- 
ried out using a MicroCal VP-ITC calorimeter. In a typical 
experiment, 20 1M protein was titrated into 1 uM RNA in 
50 mM K*phosphate, pH 6.0 and 100 mM KCl] at 25.0 °C, 
unless otherwise noted. All experiments were carried out 
in triplicate and the averaged values reported. Best fits 
were generated using a single site binding model described 
previously.” 


Recovery and Characterization of Mutant Viruses 


The cDNA in vitro assembly reverse genetic system 
described previously” was used to generate viral genomes 
containing the N Y127A and Y129A mutations. To generate 
mutant viruses, cDNAs representing the entire MHV 
genome were constructed by sequential ligation of the A- 
G cDNA fragments as described previously.?*°’ The 
ligated cDNAs representing mutant or wild type N gene- 
containing MHV genomes were in vitro transcribed and 
electroporated into BHK-R cells in the presence of a wild- 
type or mutant N gene transcript as previously described.”” 
Cultures were observed for up to 72 h for the development 
of cytopathic effect (cell fusion) and harvested by freezing 
at —70 °C. The recovered viruses were plaque isolated and 
expanded on DBT cells. Total RNAs were extracted using 
QIAGEN RNeasy kit. The entire N gene of each plaque 
isolate as well as their 5’ and 3’ UTRs were sequenced to 
verify the genotype of the recovered viruses. 


Accession codes 


Atomic coordinates and structure factors of MHV N 
NTD have been deposited in the Protein Data Bank under 
accession code 3hd4. 
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